Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 2186 |
| Missing cells | 6958 |
| Missing cells (%) | 16.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.3 MiB |
| Average record size in memory | 642.0 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 7 |
ReporterISO has constant value "CHN" | Constant |
FlowDesc has constant value "Import" | Constant |
PartnerISO has a high cardinality: 222 distinct values | High cardinality |
Key has a high cardinality: 2186 distinct values | High cardinality |
Country Code has a high cardinality: 205 distinct values | High cardinality |
gdp has a high cardinality: 1985 distinct values | High cardinality |
j has a high cardinality: 192 distinct values | High cardinality |
RefYear is highly overall correlated with Period and 6 other fields | High correlation |
Period is highly overall correlated with RefYear and 6 other fields | High correlation |
Cifvalue is highly overall correlated with PrimaryValue | High correlation |
PrimaryValue is highly overall correlated with Cifvalue | High correlation |
year is highly overall correlated with RefYear and 6 other fields | High correlation |
sum_pos_tweets is highly overall correlated with RefYear and 6 other fields | High correlation |
count_tweets is highly overall correlated with RefYear and 6 other fields | High correlation |
sum_political_tweets is highly overall correlated with RefYear and 6 other fields | High correlation |
sum_likes is highly overall correlated with RefYear and 6 other fields | High correlation |
sum_retweeets is highly overall correlated with RefYear and 6 other fields | High correlation |
Country Code has 138 (6.3%) missing values | Missing |
year has 138 (6.3%) missing values | Missing |
gdp has 138 (6.3%) missing values | Missing |
population has 138 (6.3%) missing values | Missing |
j has 268 (12.3%) missing values | Missing |
dist has 298 (13.6%) missing values | Missing |
sum_pos_tweets has 1168 (53.4%) missing values | Missing |
count_tweets has 1168 (53.4%) missing values | Missing |
sum_political_tweets has 1168 (53.4%) missing values | Missing |
sum_likes has 1168 (53.4%) missing values | Missing |
sum_retweeets has 1168 (53.4%) missing values | Missing |
PartnerISO is uniformly distributed | Uniform |
Key is uniformly distributed | Uniform |
Country Code is uniformly distributed | Uniform |
j is uniformly distributed | Uniform |
Key has unique values | Unique |
sum_pos_tweets has 71 (3.2%) zeros | Zeros |
sum_likes has 109 (5.0%) zeros | Zeros |
sum_retweeets has 109 (5.0%) zeros | Zeros |
Reproduction
| Analysis started | 2023-04-10 20:14:04.031203 |
|---|---|
| Analysis finished | 2023-04-10 20:16:55.203460 |
| Duration | 2 minutes and 51.17 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
RefYear
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2016.511 |
| Minimum | 2012 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 2012 |
|---|---|
| 5-th percentile | 2012 |
| Q1 | 2014 |
| median | 2017 |
| Q3 | 2019 |
| 95-th percentile | 2021 |
| Maximum | 2021 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.8745101 |
|---|---|
| Coefficient of variation (CV) | 0.001425487 |
| Kurtosis | -1.2252865 |
| Mean | 2016.511 |
| Median Absolute Deviation (MAD) | 2.5 |
| Skewness | -0.0067243127 |
| Sum | 4408093 |
| Variance | 8.2628085 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 2018 | 220 | |
| 2019 | 220 | |
| 2021 | 220 | |
| 2013 | 219 | |
| 2016 | 219 | |
| 2017 | 219 | |
| 2020 | 219 | |
| 2012 | 218 | |
| 2014 | 217 | |
| 2015 | 215 |
| Value | Count | Frequency (%) |
| 2012 | 218 | |
| 2013 | 219 | |
| 2014 | 217 | |
| 2015 | 215 | |
| 2016 | 219 | |
| 2017 | 219 | |
| 2018 | 220 | |
| 2019 | 220 | |
| 2020 | 219 | |
| 2021 | 220 |
| Value | Count | Frequency (%) |
| 2021 | 220 | |
| 2020 | 219 | |
| 2019 | 220 | |
| 2018 | 220 | |
| 2017 | 219 | |
| 2016 | 219 | |
| 2015 | 215 | |
| 2014 | 217 | |
| 2013 | 219 | |
| 2012 | 218 |
Period
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2016.511 |
| Minimum | 2012 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 2012 |
|---|---|
| 5-th percentile | 2012 |
| Q1 | 2014 |
| median | 2017 |
| Q3 | 2019 |
| 95-th percentile | 2021 |
| Maximum | 2021 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.8745101 |
|---|---|
| Coefficient of variation (CV) | 0.001425487 |
| Kurtosis | -1.2252865 |
| Mean | 2016.511 |
| Median Absolute Deviation (MAD) | 2.5 |
| Skewness | -0.0067243127 |
| Sum | 4408093 |
| Variance | 8.2628085 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 2018 | 220 | |
| 2019 | 220 | |
| 2021 | 220 | |
| 2013 | 219 | |
| 2016 | 219 | |
| 2017 | 219 | |
| 2020 | 219 | |
| 2012 | 218 | |
| 2014 | 217 | |
| 2015 | 215 |
| Value | Count | Frequency (%) |
| 2012 | 218 | |
| 2013 | 219 | |
| 2014 | 217 | |
| 2015 | 215 | |
| 2016 | 219 | |
| 2017 | 219 | |
| 2018 | 220 | |
| 2019 | 220 | |
| 2020 | 219 | |
| 2021 | 220 |
| Value | Count | Frequency (%) |
| 2021 | 220 | |
| 2020 | 219 | |
| 2019 | 220 | |
| 2018 | 220 | |
| 2017 | 219 | |
| 2016 | 219 | |
| 2015 | 215 | |
| 2014 | 217 | |
| 2013 | 219 | |
| 2012 | 218 |
ReporterISO
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 145.2 KiB |
| CHN |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 6558 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CHN |
|---|---|
| 2nd row | CHN |
| 3rd row | CHN |
| 4th row | CHN |
| 5th row | CHN |
Common Values
| Value | Count | Frequency (%) |
| CHN | 2186 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| chn | 2186 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 2186 | |
| H | 2186 | |
| N | 2186 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6558 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2186 | |
| H | 2186 | |
| N | 2186 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6558 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 2186 | |
| H | 2186 | |
| N | 2186 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6558 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 2186 | |
| H | 2186 | |
| N | 2186 |
FlowDesc
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 151.6 KiB |
| Import |
|---|
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 13116 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Import |
|---|---|
| 2nd row | Import |
| 3rd row | Import |
| 4th row | Import |
| 5th row | Import |
Common Values
| Value | Count | Frequency (%) |
| Import | 2186 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| import | 2186 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 2186 | |
| m | 2186 | |
| p | 2186 | |
| o | 2186 | |
| r | 2186 | |
| t | 2186 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10930 | |
| Uppercase Letter | 2186 | 16.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 2186 | |
| p | 2186 | |
| o | 2186 | |
| r | 2186 | |
| t | 2186 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 2186 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13116 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 2186 | |
| m | 2186 | |
| p | 2186 | |
| o | 2186 | |
| r | 2186 | |
| t | 2186 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13116 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 2186 | |
| m | 2186 | |
| p | 2186 | |
| o | 2186 | |
| r | 2186 | |
| t | 2186 |
PartnerISO
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 222 |
|---|---|
| Distinct (%) | 10.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 145.5 KiB |
| W00 | 10 |
|---|---|
| CUW | 10 |
| BES | 10 |
| NCL | 10 |
| VUT | 10 |
| Other values (217) |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 3.0182983 |
| Min length | 3 |
Characters and Unicode
| Total characters | 6598 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | W00 |
|---|---|
| 2nd row | AFG |
| 3rd row | ALB |
| 4th row | DZA |
| 5th row | AND |
Common Values
| Value | Count | Frequency (%) |
| W00 | 10 | 0.5% |
| CUW | 10 | 0.5% |
| BES | 10 | 0.5% |
| NCL | 10 | 0.5% |
| VUT | 10 | 0.5% |
| NZL | 10 | 0.5% |
| NIC | 10 | 0.5% |
| NER | 10 | 0.5% |
| NGA | 10 | 0.5% |
| 19,00Â F | 10 | 0.5% |
| Other values (212) | 2086 |
Length
| Value | Count | Frequency (%) |
| w00 | 10 | 0.5% |
| bdi | 10 | 0.5% |
| dom | 10 | 0.5% |
| brn | 10 | 0.5% |
| alb | 10 | 0.5% |
| dza | 10 | 0.5% |
| and | 10 | 0.5% |
| ago | 10 | 0.5% |
| atg | 10 | 0.5% |
| aze | 10 | 0.5% |
| Other values (213) | 2096 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 488 | 7.4% |
| R | 480 | 7.3% |
| N | 426 | 6.5% |
| M | 418 | 6.3% |
| S | 395 | 6.0% |
| B | 359 | 5.4% |
| L | 348 | 5.3% |
| T | 320 | 4.8% |
| G | 320 | 4.8% |
| C | 289 | 4.4% |
| Other values (25) | 2755 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6422 | |
| Decimal Number | 136 | 2.1% |
| Space Separator | 20 | 0.3% |
| Other Punctuation | 10 | 0.2% |
| Connector Punctuation | 10 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 488 | 7.6% |
| R | 480 | 7.5% |
| N | 426 | 6.6% |
| M | 418 | 6.5% |
| S | 395 | 6.2% |
| B | 359 | 5.6% |
| L | 348 | 5.4% |
| T | 320 | 5.0% |
| G | 320 | 5.0% |
| C | 289 | 4.5% |
| Other values (16) | 2579 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 48 | |
| 0 | 40 | |
| 1 | 29 | |
| 7 | 10 | 7.4% |
| 5 | 9 | 6.6% |
Space Separator
| Value | Count | Frequency (%) |
| Â | 10 | |
| 10 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 10 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6422 | |
| Common | 176 | 2.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 488 | 7.6% |
| R | 480 | 7.5% |
| N | 426 | 6.6% |
| M | 418 | 6.5% |
| S | 395 | 6.2% |
| B | 359 | 5.6% |
| L | 348 | 5.4% |
| T | 320 | 5.0% |
| G | 320 | 5.0% |
| C | 289 | 4.5% |
| Other values (16) | 2579 |
Common
| Value | Count | Frequency (%) |
| 9 | 48 | |
| 0 | 40 | |
| 1 | 29 | |
| Â | 10 | 5.7% |
| , | 10 | 5.7% |
| 7 | 10 | 5.7% |
| _ | 10 | 5.7% |
| 10 | 5.7% | |
| 5 | 9 | 5.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6588 | |
| None | 10 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 488 | 7.4% |
| R | 480 | 7.3% |
| N | 426 | 6.5% |
| M | 418 | 6.3% |
| S | 395 | 6.0% |
| B | 359 | 5.4% |
| L | 348 | 5.3% |
| T | 320 | 4.9% |
| G | 320 | 4.9% |
| C | 289 | 4.4% |
| Other values (24) | 2745 |
None
| Value | Count | Frequency (%) |
| Â | 10 |
Cifvalue
Real number (ℝ)
| Distinct | 2184 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.8120334 × 1010 |
| Minimum | 9 |
|---|---|
| Maximum | 2.6843627 × 1012 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 9 |
|---|---|
| 5-th percentile | 8953.5 |
| Q1 | 18730681 |
| median | 2.8651095 × 108 |
| Q3 | 3.8242945 × 109 |
| 95-th percentile | 5.302828 × 1010 |
| Maximum | 2.6843627 × 1012 |
| Range | 2.6843627 × 1012 |
| Interquartile range (IQR) | 3.8055638 × 109 |
Descriptive statistics
| Standard deviation | 1.3730537 × 1011 |
|---|---|
| Coefficient of variation (CV) | 7.5774194 |
| Kurtosis | 216.21251 |
| Mean | 1.8120334 × 1010 |
| Median Absolute Deviation (MAD) | 2.8648118 × 108 |
| Skewness | 14.312647 |
| Sum | 3.9611051 × 1013 |
| Variance | 1.8852766 × 1022 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2753 | 2 | 0.1% |
| 71 | 2 | 0.1% |
| 1.818199228 × 1012 | 1 | < 0.1% |
| 3421521148 | 1 | < 0.1% |
| 21774777 | 1 | < 0.1% |
| 33160256 | 1 | < 0.1% |
| 40063 | 1 | < 0.1% |
| 2172086000 | 1 | < 0.1% |
| 81763041 | 1 | < 0.1% |
| 2825845569 | 1 | < 0.1% |
| Other values (2174) | 2174 |
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 13 | 1 | |
| 31 | 1 | |
| 33 | 1 | |
| 34 | 1 | |
| 39 | 1 | |
| 45 | 1 | |
| 71 | 2 | |
| 72 | 1 | |
| 76 | 1 |
| Value | Count | Frequency (%) |
| 2.684362679 × 1012 | 1 | |
| 2.133605397 × 1012 | 1 | |
| 2.079285499 × 1012 | 1 | |
| 2.069567865 × 1012 | 1 | |
| 1.959234625 × 1012 | 1 | |
| 1.949992315 × 1012 | 1 | |
| 1.843792939 × 1012 | 1 | |
| 1.818199228 × 1012 | 1 | |
| 1.679564325 × 1012 | 1 | |
| 1.587920688 × 1012 | 1 |
PrimaryValue
Real number (ℝ)
| Distinct | 2184 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.8120334 × 1010 |
| Minimum | 9 |
|---|---|
| Maximum | 2.6843627 × 1012 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 9 |
|---|---|
| 5-th percentile | 8953.5 |
| Q1 | 18730681 |
| median | 2.8651095 × 108 |
| Q3 | 3.8242945 × 109 |
| 95-th percentile | 5.302828 × 1010 |
| Maximum | 2.6843627 × 1012 |
| Range | 2.6843627 × 1012 |
| Interquartile range (IQR) | 3.8055638 × 109 |
Descriptive statistics
| Standard deviation | 1.3730537 × 1011 |
|---|---|
| Coefficient of variation (CV) | 7.5774194 |
| Kurtosis | 216.21251 |
| Mean | 1.8120334 × 1010 |
| Median Absolute Deviation (MAD) | 2.8648118 × 108 |
| Skewness | 14.312647 |
| Sum | 3.9611051 × 1013 |
| Variance | 1.8852766 × 1022 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2753 | 2 | 0.1% |
| 71 | 2 | 0.1% |
| 1.818199228 × 1012 | 1 | < 0.1% |
| 3421521148 | 1 | < 0.1% |
| 21774777 | 1 | < 0.1% |
| 33160256 | 1 | < 0.1% |
| 40063 | 1 | < 0.1% |
| 2172086000 | 1 | < 0.1% |
| 81763041 | 1 | < 0.1% |
| 2825845569 | 1 | < 0.1% |
| Other values (2174) | 2174 |
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 13 | 1 | |
| 31 | 1 | |
| 33 | 1 | |
| 34 | 1 | |
| 39 | 1 | |
| 45 | 1 | |
| 71 | 2 | |
| 72 | 1 | |
| 76 | 1 |
| Value | Count | Frequency (%) |
| 2.684362679 × 1012 | 1 | |
| 2.133605397 × 1012 | 1 | |
| 2.079285499 × 1012 | 1 | |
| 2.069567865 × 1012 | 1 | |
| 1.959234625 × 1012 | 1 | |
| 1.949992315 × 1012 | 1 | |
| 1.843792939 × 1012 | 1 | |
| 1.818199228 × 1012 | 1 | |
| 1.679564325 × 1012 | 1 | |
| 1.587920688 × 1012 | 1 |
Key
Categorical
HIGH CARDINALITY  UNIFORM  UNIQUE 
| Distinct | 2186 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.2 KiB |
| W00_2012 | 1 |
|---|---|
| PNG_2018 | 1 |
| NOR_2018 | 1 |
| FSM_2018 | 1 |
| MHL_2018 | 1 |
| Other values (2181) |
Length
| Max length | 12 |
|---|---|
| Median length | 8 |
| Mean length | 8.0182983 |
| Min length | 8 |
Characters and Unicode
| Total characters | 17528 |
|---|---|
| Distinct characters | 40 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 2186 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | W00_2012 |
|---|---|
| 2nd row | AFG_2012 |
| 3rd row | ALB_2012 |
| 4th row | DZA_2012 |
| 5th row | AND_2012 |
Common Values
| Value | Count | Frequency (%) |
| W00_2012 | 1 | < 0.1% |
| PNG_2018 | 1 | < 0.1% |
| NOR_2018 | 1 | < 0.1% |
| FSM_2018 | 1 | < 0.1% |
| MHL_2018 | 1 | < 0.1% |
| PLW_2018 | 1 | < 0.1% |
| PAK_2018 | 1 | < 0.1% |
| PAN_2018 | 1 | < 0.1% |
| PRY_2018 | 1 | < 0.1% |
| TGO_2018 | 1 | < 0.1% |
| Other values (2176) | 2176 |
Length
| Value | Count | Frequency (%) |
| 19,00 | 10 | 0.5% |
| x | 10 | 0.5% |
| w00_2012 | 1 | < 0.1% |
| bih_2012 | 1 | < 0.1% |
| bel_2012 | 1 | < 0.1% |
| cpv_2012 | 1 | < 0.1% |
| brb_2012 | 1 | < 0.1% |
| arm_2012 | 1 | < 0.1% |
| bgd_2012 | 1 | < 0.1% |
| bhr_2012 | 1 | < 0.1% |
| Other values (2178) | 2178 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2843 | |
| 0 | 2445 | |
| _ | 2196 | |
| 1 | 1996 | 11.4% |
| A | 488 | 2.8% |
| R | 480 | 2.7% |
| N | 426 | 2.4% |
| M | 418 | 2.4% |
| S | 395 | 2.3% |
| B | 359 | 2.0% |
| Other values (30) | 5482 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8880 | |
| Uppercase Letter | 6422 | |
| Connector Punctuation | 2196 | 12.5% |
| Space Separator | 20 | 0.1% |
| Other Punctuation | 10 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 488 | 7.6% |
| R | 480 | 7.5% |
| N | 426 | 6.6% |
| M | 418 | 6.5% |
| S | 395 | 6.2% |
| B | 359 | 5.6% |
| L | 348 | 5.4% |
| G | 320 | 5.0% |
| T | 320 | 5.0% |
| C | 289 | 4.5% |
| Other values (16) | 2579 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2843 | |
| 0 | 2445 | |
| 1 | 1996 | |
| 9 | 268 | 3.0% |
| 7 | 229 | 2.6% |
| 5 | 224 | 2.5% |
| 8 | 220 | 2.5% |
| 3 | 219 | 2.5% |
| 6 | 219 | 2.5% |
| 4 | 217 | 2.4% |
Space Separator
| Value | Count | Frequency (%) |
| Â | 10 | |
| 10 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2196 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11106 | |
| Latin | 6422 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 488 | 7.6% |
| R | 480 | 7.5% |
| N | 426 | 6.6% |
| M | 418 | 6.5% |
| S | 395 | 6.2% |
| B | 359 | 5.6% |
| L | 348 | 5.4% |
| G | 320 | 5.0% |
| T | 320 | 5.0% |
| C | 289 | 4.5% |
| Other values (16) | 2579 |
Common
| Value | Count | Frequency (%) |
| 2 | 2843 | |
| 0 | 2445 | |
| _ | 2196 | |
| 1 | 1996 | |
| 9 | 268 | 2.4% |
| 7 | 229 | 2.1% |
| 5 | 224 | 2.0% |
| 8 | 220 | 2.0% |
| 3 | 219 | 2.0% |
| 6 | 219 | 2.0% |
| Other values (4) | 247 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17518 | |
| None | 10 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2843 | |
| 0 | 2445 | |
| _ | 2196 | |
| 1 | 1996 | 11.4% |
| A | 488 | 2.8% |
| R | 480 | 2.7% |
| N | 426 | 2.4% |
| M | 418 | 2.4% |
| S | 395 | 2.3% |
| B | 359 | 2.0% |
| Other values (29) | 5472 |
None
| Value | Count | Frequency (%) |
| Â | 10 |
Country Code
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 205 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 138 |
| Missing (%) | 6.3% |
| Memory size | 141.4 KiB |
| PAN | 10 |
|---|---|
| VUT | 10 |
| NZL | 10 |
| NIC | 10 |
| NER | 10 |
| Other values (200) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 6144 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AFG |
|---|---|
| 2nd row | ALB |
| 3rd row | DZA |
| 4th row | AND |
| 5th row | AGO |
Common Values
| Value | Count | Frequency (%) |
| PAN | 10 | 0.5% |
| VUT | 10 | 0.5% |
| NZL | 10 | 0.5% |
| NIC | 10 | 0.5% |
| NER | 10 | 0.5% |
| NGA | 10 | 0.5% |
| NOR | 10 | 0.5% |
| FSM | 10 | 0.5% |
| MHL | 10 | 0.5% |
| PLW | 10 | 0.5% |
| Other values (195) | 1948 | |
| (Missing) | 138 | 6.3% |
Length
| Value | Count | Frequency (%) |
| pan | 10 | 0.5% |
| bgd | 10 | 0.5% |
| alb | 10 | 0.5% |
| dza | 10 | 0.5% |
| and | 10 | 0.5% |
| ago | 10 | 0.5% |
| atg | 10 | 0.5% |
| aze | 10 | 0.5% |
| arg | 10 | 0.5% |
| aus | 10 | 0.5% |
| Other values (195) | 1948 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 470 | 7.6% |
| A | 460 | 7.5% |
| N | 420 | 6.8% |
| M | 399 | 6.5% |
| S | 350 | 5.7% |
| B | 349 | 5.7% |
| L | 340 | 5.5% |
| G | 320 | 5.2% |
| T | 309 | 5.0% |
| C | 279 | 4.5% |
| Other values (16) | 2448 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6144 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 470 | 7.6% |
| A | 460 | 7.5% |
| N | 420 | 6.8% |
| M | 399 | 6.5% |
| S | 350 | 5.7% |
| B | 349 | 5.7% |
| L | 340 | 5.5% |
| G | 320 | 5.2% |
| T | 309 | 5.0% |
| C | 279 | 4.5% |
| Other values (16) | 2448 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6144 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 470 | 7.6% |
| A | 460 | 7.5% |
| N | 420 | 6.8% |
| M | 399 | 6.5% |
| S | 350 | 5.7% |
| B | 349 | 5.7% |
| L | 340 | 5.5% |
| G | 320 | 5.2% |
| T | 309 | 5.0% |
| C | 279 | 4.5% |
| Other values (16) | 2448 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6144 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 470 | 7.6% |
| A | 460 | 7.5% |
| N | 420 | 6.8% |
| M | 399 | 6.5% |
| S | 350 | 5.7% |
| B | 349 | 5.7% |
| L | 340 | 5.5% |
| G | 320 | 5.2% |
| T | 309 | 5.0% |
| C | 279 | 4.5% |
| Other values (16) | 2448 |
year
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 138 |
| Missing (%) | 6.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2016.5039 |
| Minimum | 2012 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 2012 |
|---|---|
| 5-th percentile | 2012 |
| Q1 | 2014 |
| median | 2017 |
| Q3 | 2019 |
| 95-th percentile | 2021 |
| Maximum | 2021 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.8716195 |
|---|---|
| Coefficient of variation (CV) | 0.0014240585 |
| Kurtosis | -1.2232164 |
| Mean | 2016.5039 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.0013177405 |
| Sum | 4129800 |
| Variance | 8.2461987 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 2014 | 205 | |
| 2015 | 205 | |
| 2016 | 205 | |
| 2017 | 205 | |
| 2018 | 205 | |
| 2019 | 205 | |
| 2020 | 205 | |
| 2021 | 205 | |
| 2012 | 204 | |
| 2013 | 204 | |
| (Missing) | 138 |
| Value | Count | Frequency (%) |
| 2012 | 204 | |
| 2013 | 204 | |
| 2014 | 205 | |
| 2015 | 205 | |
| 2016 | 205 | |
| 2017 | 205 | |
| 2018 | 205 | |
| 2019 | 205 | |
| 2020 | 205 | |
| 2021 | 205 |
| Value | Count | Frequency (%) |
| 2021 | 205 | |
| 2020 | 205 | |
| 2019 | 205 | |
| 2018 | 205 | |
| 2017 | 205 | |
| 2016 | 205 | |
| 2015 | 205 | |
| 2014 | 205 | |
| 2013 | 204 | |
| 2012 | 204 |
gdp
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 1985 |
|---|---|
| Distinct (%) | 96.9% |
| Missing | 138 |
| Missing (%) | 6.3% |
| Memory size | 165.5 KiB |
| .. | 64 |
|---|---|
| 284900000 | 1 |
| 401932300 | 1 |
| 436999692591.454 | 1 |
| 421739210176.152 | 1 |
| Other values (1980) |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 15.041504 |
| Min length | 2 |
Characters and Unicode
| Total characters | 30805 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1984 ? |
|---|---|
| Unique (%) | 96.9% |
Sample
| 1st row | 20203572959.5023 |
|---|---|
| 2nd row | 12319830437.3467 |
| 3rd row | 209058991952.125 |
| 4th row | 3188808942.56713 |
| 5th row | 124998210652.243 |
Common Values
| Value | Count | Frequency (%) |
| .. | 64 | 2.9% |
| 284900000 | 1 | < 0.1% |
| 401932300 | 1 | < 0.1% |
| 436999692591.454 | 1 | < 0.1% |
| 421739210176.152 | 1 | < 0.1% |
| 12808660528.0617 | 1 | < 0.1% |
| 13025239912.2751 | 1 | < 0.1% |
| 211953111035.513 | 1 | < 0.1% |
| 914736985.430944 | 1 | < 0.1% |
| 9846922416.14776 | 1 | < 0.1% |
| Other values (1975) | 1975 | |
| (Missing) | 138 | 6.3% |
Length
| Value | Count | Frequency (%) |
| 64 | 3.1% | |
| 4610096000 | 1 | < 0.1% |
| 12319830437.3467 | 1 | < 0.1% |
| 209058991952.125 | 1 | < 0.1% |
| 3188808942.56713 | 1 | < 0.1% |
| 124998210652.243 | 1 | < 0.1% |
| 1199948148.14815 | 1 | < 0.1% |
| 69683935845.2139 | 1 | < 0.1% |
| 545982375701.128 | 1 | < 0.1% |
| 1546892142709.84 | 1 | < 0.1% |
| Other values (1975) | 1975 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3497 | |
| 0 | 3106 | |
| 2 | 3015 | |
| 4 | 2868 | |
| 3 | 2867 | |
| 5 | 2755 | |
| 7 | 2730 | |
| 6 | 2693 | |
| 9 | 2667 | |
| 8 | 2663 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 28861 | |
| Other Punctuation | 1944 | 6.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3497 | |
| 0 | 3106 | |
| 2 | 3015 | |
| 4 | 2868 | |
| 3 | 2867 | |
| 5 | 2755 | |
| 7 | 2730 | |
| 6 | 2693 | |
| 9 | 2667 | |
| 8 | 2663 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1944 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 30805 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 3497 | |
| 0 | 3106 | |
| 2 | 3015 | |
| 4 | 2868 | |
| 3 | 2867 | |
| 5 | 2755 | |
| 7 | 2730 | |
| 6 | 2693 | |
| 9 | 2667 | |
| 8 | 2663 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30805 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 3497 | |
| 0 | 3106 | |
| 2 | 3015 | |
| 4 | 2868 | |
| 3 | 2867 | |
| 5 | 2755 | |
| 7 | 2730 | |
| 6 | 2693 | |
| 9 | 2667 | |
| 8 | 2663 |
population
Real number (ℝ)
| Distinct | 2048 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 138 |
| Missing (%) | 6.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36612266 |
| Minimum | 10444 |
|---|---|
| Maximum | 1.41236 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 10444 |
|---|---|
| 5-th percentile | 56176.25 |
| Q1 | 1262415 |
| median | 7145394 |
| Q3 | 25663486 |
| 95-th percentile | 1.247103 × 108 |
| Maximum | 1.41236 × 109 |
| Range | 1.4123496 × 109 |
| Interquartile range (IQR) | 24401072 |
Descriptive statistics
| Standard deviation | 1.3951161 × 108 |
|---|---|
| Coefficient of variation (CV) | 3.8105156 |
| Kurtosis | 78.525556 |
| Mean | 36612266 |
| Median Absolute Deviation (MAD) | 6829983 |
| Skewness | 8.5879579 |
| Sum | 7.4981922 × 1010 |
| Variance | 1.946349 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2873457 | 1 | < 0.1% |
| 37974750 | 1 | < 0.1% |
| 108568836 | 1 | < 0.1% |
| 32203944 | 1 | < 0.1% |
| 6443328 | 1 | < 0.1% |
| 9329227 | 1 | < 0.1% |
| 4165255 | 1 | < 0.1% |
| 219731479 | 1 | < 0.1% |
| 17864 | 1 | < 0.1% |
| 45989 | 1 | < 0.1% |
| Other values (2038) | 2038 | |
| (Missing) | 138 | 6.3% |
| Value | Count | Frequency (%) |
| 10444 | 1 | |
| 10694 | 1 | |
| 10828 | 1 | |
| 10852 | 1 | |
| 10854 | 1 | |
| 10865 | 1 | |
| 10877 | 1 | |
| 10899 | 1 | |
| 10918 | 1 | |
| 10940 | 1 |
| Value | Count | Frequency (%) |
| 1412360000 | 1 | |
| 1411100000 | 1 | |
| 1407745000 | 1 | |
| 1407563842 | 1 | |
| 1402760000 | 1 | |
| 1396387127 | 1 | |
| 1396215000 | 1 | |
| 1387790000 | 1 | |
| 1383112050 | 1 | |
| 1379860000 | 1 |
j
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 192 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 268 |
| Missing (%) | 12.3% |
| Memory size | 137.8 KiB |
| AFG | 10 |
|---|---|
| NZL | 10 |
| NIC | 10 |
| NER | 10 |
| NGA | 10 |
| Other values (187) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 5754 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AFG |
|---|---|
| 2nd row | ALB |
| 3rd row | DZA |
| 4th row | AND |
| 5th row | AGO |
Common Values
| Value | Count | Frequency (%) |
| AFG | 10 | 0.5% |
| NZL | 10 | 0.5% |
| NIC | 10 | 0.5% |
| NER | 10 | 0.5% |
| NGA | 10 | 0.5% |
| NOR | 10 | 0.5% |
| FSM | 10 | 0.5% |
| MHL | 10 | 0.5% |
| PLW | 10 | 0.5% |
| PAK | 10 | 0.5% |
| Other values (182) | 1818 | |
| (Missing) | 268 | 12.3% |
Length
| Value | Count | Frequency (%) |
| afg | 10 | 0.5% |
| arm | 10 | 0.5% |
| and | 10 | 0.5% |
| ago | 10 | 0.5% |
| atg | 10 | 0.5% |
| aze | 10 | 0.5% |
| arg | 10 | 0.5% |
| aus | 10 | 0.5% |
| aut | 10 | 0.5% |
| bhs | 10 | 0.5% |
| Other values (182) | 1818 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 460 | 8.0% |
| A | 440 | 7.6% |
| N | 400 | 7.0% |
| M | 389 | 6.8% |
| B | 329 | 5.7% |
| G | 320 | 5.6% |
| L | 300 | 5.2% |
| T | 299 | 5.2% |
| S | 290 | 5.0% |
| C | 249 | 4.3% |
| Other values (16) | 2278 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5754 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 460 | 8.0% |
| A | 440 | 7.6% |
| N | 400 | 7.0% |
| M | 389 | 6.8% |
| B | 329 | 5.7% |
| G | 320 | 5.6% |
| L | 300 | 5.2% |
| T | 299 | 5.2% |
| S | 290 | 5.0% |
| C | 249 | 4.3% |
| Other values (16) | 2278 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5754 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 460 | 8.0% |
| A | 440 | 7.6% |
| N | 400 | 7.0% |
| M | 389 | 6.8% |
| B | 329 | 5.7% |
| G | 320 | 5.6% |
| L | 300 | 5.2% |
| T | 299 | 5.2% |
| S | 290 | 5.0% |
| C | 249 | 4.3% |
| Other values (16) | 2278 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5754 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 460 | 8.0% |
| A | 440 | 7.6% |
| N | 400 | 7.0% |
| M | 389 | 6.8% |
| B | 329 | 5.7% |
| G | 320 | 5.6% |
| L | 300 | 5.2% |
| T | 299 | 5.2% |
| S | 290 | 5.0% |
| C | 249 | 4.3% |
| Other values (16) | 2278 |
dist
Real number (ℝ)
| Distinct | 189 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 298 |
| Missing (%) | 13.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9047.2762 |
| Minimum | 809.5382 |
|---|---|
| Maximum | 19297.47 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 809.5382 |
|---|---|
| 5-th percentile | 2850.319 |
| Q1 | 6523.571 |
| median | 8390.566 |
| Q3 | 11903.59 |
| 95-th percentile | 14866.92 |
| Maximum | 19297.47 |
| Range | 18487.932 |
| Interquartile range (IQR) | 5380.019 |
Descriptive statistics
| Standard deviation | 3877.9301 |
|---|---|
| Coefficient of variation (CV) | 0.42862956 |
| Kurtosis | -0.29941613 |
| Mean | 9047.2762 |
| Median Absolute Deviation (MAD) | 2650.464 |
| Skewness | 0.2411037 |
| Sum | 17081258 |
| Variance | 15038342 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14352.56 | 10 | 0.5% |
| 11041.03 | 10 | 0.5% |
| 13785.57 | 10 | 0.5% |
| 11024.17 | 10 | 0.5% |
| 11466.06 | 10 | 0.5% |
| 7031.006 | 10 | 0.5% |
| 5548.78 | 10 | 0.5% |
| 6537.865 | 10 | 0.5% |
| 4048.299 | 10 | 0.5% |
| 3882.877 | 10 | 0.5% |
| Other values (179) | 1788 | |
| (Missing) | 298 | 13.6% |
| Value | Count | Frequency (%) |
| 809.5382 | 10 | |
| 955.6511 | 10 | |
| 1172.047 | 10 | |
| 1976.249 | 10 | |
| 1982.745 | 10 | |
| 2098.111 | 10 | |
| 2330.799 | 10 | |
| 2778.652 | 10 | |
| 2812.561 | 10 | |
| 2850.319 | 10 |
| Value | Count | Frequency (%) |
| 19297.47 | 10 | |
| 19175.59 | 10 | |
| 19079.88 | 10 | |
| 18311.35 | 10 | |
| 17614.3 | 10 | |
| 17389.85 | 10 | |
| 16666.29 | 10 | |
| 15364.41 | 10 | |
| 14937.48 | 10 | |
| 14866.92 | 10 |
sum_pos_tweets
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 685 |
|---|---|
| Distinct (%) | 67.3% |
| Missing | 1168 |
| Missing (%) | 53.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13170.081 |
| Minimum | 0 |
|---|---|
| Maximum | 708560 |
| Zeros | 71 |
| Zeros (%) | 3.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 17 |
| median | 488 |
| Q3 | 4194.5 |
| 95-th percentile | 55283.15 |
| Maximum | 708560 |
| Range | 708560 |
| Interquartile range (IQR) | 4177.5 |
Descriptive statistics
| Standard deviation | 55057.135 |
|---|---|
| Coefficient of variation (CV) | 4.1804707 |
| Kurtosis | 74.669214 |
| Mean | 13170.081 |
| Median Absolute Deviation (MAD) | 487 |
| Skewness | 7.9359981 |
| Sum | 13407142 |
| Variance | 3.0312882 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 71 | 3.2% |
| 1 | 54 | 2.5% |
| 2 | 20 | 0.9% |
| 3 | 17 | 0.8% |
| 4 | 14 | 0.6% |
| 5 | 13 | 0.6% |
| 6 | 11 | 0.5% |
| 8 | 8 | 0.4% |
| 7 | 8 | 0.4% |
| 10 | 7 | 0.3% |
| Other values (675) | 795 | |
| (Missing) | 1168 |
| Value | Count | Frequency (%) |
| 0 | 71 | |
| 1 | 54 | |
| 2 | 20 | 0.9% |
| 3 | 17 | 0.8% |
| 4 | 14 | 0.6% |
| 5 | 13 | 0.6% |
| 6 | 11 | 0.5% |
| 7 | 8 | 0.4% |
| 8 | 8 | 0.4% |
| 9 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 708560 | 1 | |
| 646051 | 1 | |
| 570562 | 1 | |
| 510945 | 1 | |
| 510064 | 1 | |
| 445639 | 1 | |
| 431775 | 1 | |
| 332144 | 1 | |
| 283451 | 1 | |
| 281640 | 1 |
count_tweets
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 752 |
|---|---|
| Distinct (%) | 73.9% |
| Missing | 1168 |
| Missing (%) | 53.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30153.134 |
| Minimum | 1 |
|---|---|
| Maximum | 1557851 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 46 |
| median | 1238.5 |
| Q3 | 9314.25 |
| 95-th percentile | 124532.8 |
| Maximum | 1557851 |
| Range | 1557850 |
| Interquartile range (IQR) | 9268.25 |
Descriptive statistics
| Standard deviation | 124806.14 |
|---|---|
| Coefficient of variation (CV) | 4.139077 |
| Kurtosis | 72.637998 |
| Mean | 30153.134 |
| Median Absolute Deviation (MAD) | 1235.5 |
| Skewness | 7.8440606 |
| Sum | 30695890 |
| Variance | 1.5576573 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 47 | 2.2% |
| 2 | 30 | 1.4% |
| 3 | 20 | 0.9% |
| 4 | 14 | 0.6% |
| 5 | 12 | 0.5% |
| 6 | 8 | 0.4% |
| 14 | 8 | 0.4% |
| 12 | 7 | 0.3% |
| 8 | 7 | 0.3% |
| 17 | 6 | 0.3% |
| Other values (742) | 859 | |
| (Missing) | 1168 |
| Value | Count | Frequency (%) |
| 1 | 47 | |
| 2 | 30 | |
| 3 | 20 | |
| 4 | 14 | 0.6% |
| 5 | 12 | 0.5% |
| 6 | 8 | 0.4% |
| 7 | 5 | 0.2% |
| 8 | 7 | 0.3% |
| 9 | 6 | 0.3% |
| 10 | 5 | 0.2% |
| Value | Count | Frequency (%) |
| 1557851 | 1 | |
| 1428281 | 1 | |
| 1360903 | 1 | |
| 1209281 | 1 | |
| 1107731 | 1 | |
| 1004543 | 1 | |
| 969659 | 1 | |
| 796335 | 1 | |
| 628125 | 1 | |
| 626655 | 1 |
sum_political_tweets
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 752 |
|---|---|
| Distinct (%) | 73.9% |
| Missing | 1168 |
| Missing (%) | 53.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30153.134 |
| Minimum | 1 |
|---|---|
| Maximum | 1557851 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 46 |
| median | 1238.5 |
| Q3 | 9314.25 |
| 95-th percentile | 124532.8 |
| Maximum | 1557851 |
| Range | 1557850 |
| Interquartile range (IQR) | 9268.25 |
Descriptive statistics
| Standard deviation | 124806.14 |
|---|---|
| Coefficient of variation (CV) | 4.139077 |
| Kurtosis | 72.637998 |
| Mean | 30153.134 |
| Median Absolute Deviation (MAD) | 1235.5 |
| Skewness | 7.8440606 |
| Sum | 30695890 |
| Variance | 1.5576573 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 47 | 2.2% |
| 2 | 30 | 1.4% |
| 3 | 20 | 0.9% |
| 4 | 14 | 0.6% |
| 5 | 12 | 0.5% |
| 6 | 8 | 0.4% |
| 14 | 8 | 0.4% |
| 12 | 7 | 0.3% |
| 8 | 7 | 0.3% |
| 17 | 6 | 0.3% |
| Other values (742) | 859 | |
| (Missing) | 1168 |
| Value | Count | Frequency (%) |
| 1 | 47 | |
| 2 | 30 | |
| 3 | 20 | |
| 4 | 14 | 0.6% |
| 5 | 12 | 0.5% |
| 6 | 8 | 0.4% |
| 7 | 5 | 0.2% |
| 8 | 7 | 0.3% |
| 9 | 6 | 0.3% |
| 10 | 5 | 0.2% |
| Value | Count | Frequency (%) |
| 1557851 | 1 | |
| 1428281 | 1 | |
| 1360903 | 1 | |
| 1209281 | 1 | |
| 1107731 | 1 | |
| 1004543 | 1 | |
| 969659 | 1 | |
| 796335 | 1 | |
| 628125 | 1 | |
| 626655 | 1 |
sum_likes
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 726 |
|---|---|
| Distinct (%) | 71.3% |
| Missing | 1168 |
| Missing (%) | 53.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 110792.33 |
| Minimum | 0 |
|---|---|
| Maximum | 6919036 |
| Zeros | 109 |
| Zeros (%) | 5.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 17 |
| median | 2131 |
| Q3 | 26538 |
| 95-th percentile | 488111.3 |
| Maximum | 6919036 |
| Range | 6919036 |
| Interquartile range (IQR) | 26521 |
Descriptive statistics
| Standard deviation | 498488.8 |
|---|---|
| Coefficient of variation (CV) | 4.4993079 |
| Kurtosis | 84.355394 |
| Mean | 110792.33 |
| Median Absolute Deviation (MAD) | 2131 |
| Skewness | 8.3926264 |
| Sum | 1.1278659 × 108 |
| Variance | 2.4849108 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 109 | 5.0% |
| 1 | 44 | 2.0% |
| 3 | 20 | 0.9% |
| 2 | 19 | 0.9% |
| 5 | 8 | 0.4% |
| 11 | 8 | 0.4% |
| 8 | 6 | 0.3% |
| 4 | 6 | 0.3% |
| 7 | 6 | 0.3% |
| 23 | 6 | 0.3% |
| Other values (716) | 786 | |
| (Missing) | 1168 |
| Value | Count | Frequency (%) |
| 0 | 109 | |
| 1 | 44 | |
| 2 | 19 | 0.9% |
| 3 | 20 | 0.9% |
| 4 | 6 | 0.3% |
| 5 | 8 | 0.4% |
| 6 | 3 | 0.1% |
| 7 | 6 | 0.3% |
| 8 | 6 | 0.3% |
| 9 | 4 | 0.2% |
| Value | Count | Frequency (%) |
| 6919036 | 1 | |
| 6234509 | 1 | |
| 4990612 | 1 | |
| 4367973 | 1 | |
| 4181543 | 1 | |
| 3828365 | 1 | |
| 3652751 | 1 | |
| 3512024 | 1 | |
| 2857956 | 1 | |
| 2665921 | 1 |
sum_retweeets
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 728 |
|---|---|
| Distinct (%) | 71.5% |
| Missing | 1168 |
| Missing (%) | 53.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43813.677 |
| Minimum | 0 |
|---|---|
| Maximum | 2242006 |
| Zeros | 109 |
| Zeros (%) | 5.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 16 |
| median | 1193.5 |
| Q3 | 12716.5 |
| 95-th percentile | 194828.45 |
| Maximum | 2242006 |
| Range | 2242006 |
| Interquartile range (IQR) | 12700.5 |
Descriptive statistics
| Standard deviation | 185469.94 |
|---|---|
| Coefficient of variation (CV) | 4.2331518 |
| Kurtosis | 68.946242 |
| Mean | 43813.677 |
| Median Absolute Deviation (MAD) | 1193.5 |
| Skewness | 7.749181 |
| Sum | 44602323 |
| Variance | 3.43991 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 109 | 5.0% |
| 1 | 41 | 1.9% |
| 3 | 17 | 0.8% |
| 4 | 13 | 0.6% |
| 2 | 13 | 0.6% |
| 6 | 10 | 0.5% |
| 10 | 8 | 0.4% |
| 7 | 7 | 0.3% |
| 8 | 6 | 0.3% |
| 14 | 6 | 0.3% |
| Other values (718) | 788 | |
| (Missing) | 1168 |
| Value | Count | Frequency (%) |
| 0 | 109 | |
| 1 | 41 | 1.9% |
| 2 | 13 | 0.6% |
| 3 | 17 | 0.8% |
| 4 | 13 | 0.6% |
| 5 | 6 | 0.3% |
| 6 | 10 | 0.5% |
| 7 | 7 | 0.3% |
| 8 | 6 | 0.3% |
| 9 | 4 | 0.2% |
| Value | Count | Frequency (%) |
| 2242006 | 1 | |
| 1911858 | 1 | |
| 1860205 | 1 | |
| 1820632 | 1 | |
| 1786003 | 1 | |
| 1692160 | 1 | |
| 1574074 | 1 | |
| 1391875 | 1 | |
| 1100089 | 1 | |
| 991547 | 1 |
| RefYear | Period | Cifvalue | PrimaryValue | year | population | dist | sum_pos_tweets | count_tweets | sum_political_tweets | sum_likes | sum_retweeets | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RefYear | 1.000 | 1.000 | 0.027 | 0.027 | 1.000 | 0.002 | 0.002 | 0.718 | 0.704 | 0.704 | 0.797 | 0.727 |
| Period | 1.000 | 1.000 | 0.027 | 0.027 | 1.000 | 0.002 | 0.002 | 0.718 | 0.704 | 0.704 | 0.797 | 0.727 |
| Cifvalue | 0.027 | 0.027 | 1.000 | 1.000 | 0.036 | 0.133 | -0.281 | 0.315 | 0.330 | 0.330 | 0.268 | 0.316 |
| PrimaryValue | 0.027 | 0.027 | 1.000 | 1.000 | 0.036 | 0.133 | -0.281 | 0.315 | 0.330 | 0.330 | 0.268 | 0.316 |
| year | 1.000 | 1.000 | 0.036 | 0.036 | 1.000 | 0.002 | 0.002 | 0.718 | 0.704 | 0.704 | 0.797 | 0.727 |
| population | 0.002 | 0.002 | 0.133 | 0.133 | 0.002 | 1.000 | -0.128 | -0.057 | -0.058 | -0.058 | -0.058 | -0.058 |
| dist | 0.002 | 0.002 | -0.281 | -0.281 | 0.002 | -0.128 | 1.000 | -0.093 | -0.095 | -0.095 | -0.075 | -0.098 |
| sum_pos_tweets | 0.718 | 0.718 | 0.315 | 0.315 | 0.718 | -0.057 | -0.093 | 1.000 | 0.997 | 0.997 | 0.979 | 0.983 |
| count_tweets | 0.704 | 0.704 | 0.330 | 0.330 | 0.704 | -0.058 | -0.095 | 0.997 | 1.000 | 1.000 | 0.976 | 0.984 |
| sum_political_tweets | 0.704 | 0.704 | 0.330 | 0.330 | 0.704 | -0.058 | -0.095 | 0.997 | 1.000 | 1.000 | 0.976 | 0.984 |
| sum_likes | 0.797 | 0.797 | 0.268 | 0.268 | 0.797 | -0.058 | -0.075 | 0.979 | 0.976 | 0.976 | 1.000 | 0.984 |
| sum_retweeets | 0.727 | 0.727 | 0.316 | 0.316 | 0.727 | -0.058 | -0.098 | 0.983 | 0.984 | 0.984 | 0.984 | 1.000 |
| RefYear | Period | ReporterISO | FlowDesc | PartnerISO | Cifvalue | PrimaryValue | Key | Country Code | year | gdp | population | j | dist | sum_pos_tweets | count_tweets | sum_political_tweets | sum_likes | sum_retweeets | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2012 | 2012 | CHN | Import | W00 | 1.818199e+12 | 1818199227571 | W00_2012 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1 | 2012 | 2012 | CHN | Import | AFG | 5.186565e+06 | 5186565 | AFG_2012 | AFG | 2012 | 20203572959.5023 | 30466479 | AFG | 4180.438 | NaN | NaN | NaN | NaN | NaN |
| 2 | 2012 | 2012 | CHN | Import | ALB | 1.427209e+08 | 142720886 | ALB_2012 | ALB | 2012 | 12319830437.3467 | 2900401 | ALB | 7686.079 | NaN | NaN | NaN | NaN | NaN |
| 3 | 2012 | 2012 | CHN | Import | DZA | 2.311906e+09 | 2311905609 | DZA_2012 | DZA | 2012 | 209058991952.125 | 37260563 | DZA | 9117.676 | NaN | NaN | NaN | NaN | NaN |
| 4 | 2012 | 2012 | CHN | Import | AND | 3.240020e+05 | 324002 | AND_2012 | AND | 2012 | 3188808942.56713 | 71013 | AND | 8764.593 | NaN | NaN | NaN | NaN | NaN |
| 5 | 2012 | 2012 | CHN | Import | AGO | 3.356190e+10 | 33561896917 | AGO_2012 | AGO | 2012 | 124998210652.243 | 25188292 | AGO | 11769.510 | NaN | NaN | NaN | NaN | NaN |
| 6 | 2012 | 2012 | CHN | Import | ATG | 7.135100e+04 | 71351 | ATG_2012 | ATG | 2012 | 1199948148.14815 | 87674 | ATG | 13681.690 | NaN | NaN | NaN | NaN | NaN |
| 7 | 2012 | 2012 | CHN | Import | AZE | 2.141617e+08 | 214161731 | AZE_2012 | AZE | 2012 | 69683935845.2139 | 9295784 | AZE | 5520.214 | NaN | NaN | NaN | NaN | NaN |
| 8 | 2012 | 2012 | CHN | Import | ARG | 6.560806e+09 | 6560805532 | ARG_2012 | ARG | 2012 | 545982375701.128 | 41733271 | ARG | 19297.470 | NaN | NaN | NaN | NaN | NaN |
| 9 | 2012 | 2012 | CHN | Import | AUS | 8.456821e+10 | 84568208584 | AUS_2012 | AUS | 2012 | 1546892142709.84 | 22733465 | AUS | 8956.436 | NaN | NaN | NaN | NaN | NaN |
| RefYear | Period | ReporterISO | FlowDesc | PartnerISO | Cifvalue | PrimaryValue | Key | Country Code | year | gdp | population | j | dist | sum_pos_tweets | count_tweets | sum_political_tweets | sum_likes | sum_retweeets | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2176 | 2021 | 2021 | CHN | Import | USA | 1.809719e+11 | 180971932243 | USA_2021 | USA | 2021 | 23315080560000 | 331893745 | USA | 10993.680 | NaN | NaN | NaN | NaN | NaN |
| 2177 | 2021 | 2021 | CHN | Import | BFA | 1.884526e+08 | 188452598 | BFA_2021 | BFA | 2021 | 19737615114.3661 | 22100683 | BFA | 11404.370 | 8612.0 | 19080.0 | 19080.0 | 135312.0 | 45960.0 |
| 2178 | 2021 | 2021 | CHN | Import | URY | 3.623471e+09 | 3623470751 | URY_2021 | URY | 2021 | 59319484710.6527 | 3426260 | URY | 19175.590 | 1754.0 | 4007.0 | 4007.0 | 20283.0 | 5608.0 |
| 2179 | 2021 | 2021 | CHN | Import | UZB | 1.540988e+09 | 1540987879 | UZB_2021 | UZB | 2021 | 69238903106.1738 | 34915100 | UZB | 3943.621 | NaN | NaN | NaN | NaN | NaN |
| 2180 | 2021 | 2021 | CHN | Import | VEN | 9.977931e+08 | 997793138 | VEN_2021 | VEN | 2021 | .. | 28199867 | VEN | 14402.500 | 1107.0 | 2434.0 | 2434.0 | 17253.0 | 4293.0 |
| 2181 | 2021 | 2021 | CHN | Import | WLF | 1.475400e+04 | 14754 | WLF_2021 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 2182 | 2021 | 2021 | CHN | Import | WSM | 6.432650e+05 | 643265 | WSM_2021 | WSM | 2021 | 843842416.462442 | 218764 | WSM | 8268.319 | 7.0 | 12.0 | 12.0 | 29.0 | 18.0 |
| 2183 | 2021 | 2021 | CHN | Import | YEM | 4.708126e+08 | 470812557 | YEM_2021 | YEM | 2021 | .. | 32981641 | YEM | 7417.418 | NaN | NaN | NaN | NaN | NaN |
| 2184 | 2021 | 2021 | CHN | Import | ZMB | 4.385251e+09 | 4385251435 | ZMB_2021 | ZMB | 2021 | 22147634727.3584 | 19473125 | ZMB | 10960.790 | 2397.0 | 5458.0 | 5458.0 | 18965.0 | 5874.0 |
| 2185 | 2021 | 2021 | CHN | Import | _X | 2.060227e+09 | 2060227074 | _X _2021 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |